Behavioral Profiles: a fine-grained and quantitative approach in corpus-based lexical semantics

نویسنده

  • Stefan Th. Gries
چکیده

The domain of linguistics that has probably been studied most with corpora is lexical semantics. The main assumption underlying nearly all corpus-based work in lexical (and constructional) semantics is that the distributional characteristics of a linguistic expression reveal many if not most of its semantic and functional properties. The maybe most widely-cited statement to this effect is Firth's (1957:11) famous dictum that "[y]ou shall know a word by the company it keeps." However, other quotes may be actually even more explicit and instructive, such as Bolinger's (1968:127) statement that "a difference in syntactic form always spells a difference in meaning" or Cruse's (1986:1) statement that "the semantic properties of a lexical item are fully reflected in appropriate aspects of the relations it contracts with actual and potential contexts." Most explicit in this regard is Harris (1970:785f.):

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Running Head: Behavioral Profiles 1 Behavioral Profiles: A fine-grained and quantitative approach in corpus-based lexical semantics

This paper introduces a fairly recent corpus-based approach to lexical semantics, the Behavioral Profile (BP)approach. After a short review of traditional corpus-based work on lexical semantics and its shortcomings, I explain the logic and methodology of the BP approach and exemplify its application to different lexical relations (polysemy, synonymy, antonymy) in English and Russian with an eye...

متن کامل

Inferring Semantics from Collocation Clusters to Represent Verbs and Nouns

Current lexical semantic theories provide representations at a coarse grained level. In this paper, I will provide motivations for a fine grained representation for verbs and. nouns. An initial case study is done to serve as evidence that a more detailed representation is needed for tasks that require high accuracy rates, such as machine translation. An automatic approach to gather fine grained...

متن کامل

That's So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using #petpeeve Tweets

We propose a novel data augmentation approach to enhance computational behavioral analysis using social media text. In particular, we collect a Twitter corpus of the descriptions of annoying behaviors using the #petpeeve hashtags. In the qualitative analysis, we study the language use in these tweets, with a special focus on the fine-grained categories and the geographic variation of the langua...

متن کامل

Extending Fine-Grained Semantic Relation Classification to Presupposition Relations between Verbs

In contrast to typical semantic relations between verbs, such as antonymy, synonymy or hyponymy, presupposition is a lexical relation that is not very well covered in existing lexical resources. It is also understudied in the field of corpus-based methods of learning semantic relations. But presupposition is very important for the quality of automatic semantic and discourse analysis tasks. In t...

متن کامل

A fine-grained and quantitative approach in corpus-based lexical semantics

This electronic file may not be altered in any way. The author(s) of this article is/are permitted to use this PDF file to generate printed copies to be used by way of offprints, for their personal use only. Permission is granted by the publishers to post this file on a closed server which is accessible to members (students and staff) only of the author’s/s’ institute, it is not permitted to po...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010